Dimensionality estimation without distances

نویسندگان

  • Matthäus Kleindessner
  • Ulrike von Luxburg
چکیده

Theorem (Consistency of EDP and ECAP) Let the regularity assumptions hold. Let D = {x1, . . . , xn} ⊆ X be an i.i.d. sample from f and G be the directed, unweighted kNN-graph on D. Given G as input and a vertex i ∈ {1, . . . , n} chosen uniformly at random, both EDP({i}) and ECAP({i}) converge to the true dimension d in probability as n→∞ if k = k(n) satisfies k ∈ o(n), logn ∈ o(k), and there exists k′ = k′(n) with k′ ∈ o(k) and logn ∈ o(k′). How can we choose k and k′? For example: k = (logn)1+τ and k′ = (logn)1+τ/2 for some τ > 0

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Dimensionality Estimation, Manifold Learning and Function Approximation using Tensor Voting

We address instance-based learning from a perceptual organization standpoint and present methods for dimensionality estimation, manifold learning and function approximation. Under our approach, manifolds in high-dimensional spaces are inferred by estimating geometric relationships among the input instances. Unlike conventional manifold learning, we do not perform dimensionality reduction, but i...

متن کامل

Intrinsic Dimensionality Estimation in Visualizing Toxicity Data

Over the years, a number of dimensionality reduction techniques have been proposed and used in chemo informatics to perform nonlinear mappings. Nevertheless, data visualization techniques can be efficiently applied for dimensionality reduction mainly in a case if the data are not really high-dimensional and can be represented as a nonlinear low-dimensional manifold when it is possible to reduce...

متن کامل

Location and dimensionality estimation of geological bodies using eigenvectors of "Computed Gravity Gradient Tensor"

One of the methodologies employed in gravimetry exploration is eigenvector analysis of Gravity Gradient Tensor (GGT) which yields a solution including an estimation of a causative body’s Center of Mass (COM), dimensionality and strike direction. The eigenvectors of GGT give very rewarding clues about COM and strike direction. Additionally, the relationships between its components provide a quan...

متن کامل

Enhanced Estimation of Local Intrinsic Dimensionality Using Auxiliary Distances

Estimating Intrinsic Dimensionality (ID) is of high interest in many machine learning tasks, including dimensionality reduction, outlier detection, similarity search and subspace clustering. Our proposed estimation strategy, ALID, makes use of a subset of the available intra-neighborhood distances to achieve faster convergence with fewer samples, and can thus be used on applications in which th...

متن کامل

2D Dimensionality Reduction Methods without Loss

In this paper, several two-dimensional extensions of principal component analysis (PCA) and linear discriminant analysis (LDA) techniques has been applied in a lossless dimensionality reduction framework, for face recognition application. In this framework, the benefits of dimensionality reduction were used to improve the performance of its predictive model, which was a support vector machine (...

متن کامل

Energy-aware adaptive Johnson-Lindenstrauss embedding via RIP-based designs

We consider a dimensionality reducing matrix design based on training data with constraints on its Frobenius norm and number of rows. Our design criteria is aimed at preserving the distances between the data points in the dimensionality reduced space as much as possible relative to their distances in original data space. This approach can be considered as a deterministic Johnson-Lindenstrauss e...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015